Active Frame Selection for Label Propagation in Videos

نویسندگان

  • Sudheendra Vijayanarasimhan
  • Kristen Grauman
چکیده

Manually segmenting and labeling objects in video sequences is quite tedious, yet such annotations are valuable for learning-based approaches to object and activity recognition. While automatic label propagation can help, existing methods simply propagate annotations from arbitrarily selected frames (e.g., the first one) and so may fail to best leverage the human effort invested. We define an active frame selection problem: select k frames for manual labeling, such that automatic pixel-level label propagation can proceed with minimal expected error. We propose a solution that directly ties a joint frame selection criterion to the predicted errors of a flow-based random field propagation model. It selects the set of k frames that together minimize the total mislabeling risk over the entire sequence. We derive an efficient dynamic programming solution to optimize the criterion. Further, we show how to automatically determine how many total frames k should be labeled in order to minimize the total manual effort spent labeling and correcting propagation errors. We demonstrate our method’s clear advantages over several baselines, saving hours of human effort per video.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementary Material for “ Match Graph Construction for Large Image Databases ”

This appendix presents additional discussion on several aspects of the proposed algorithm. Sec. A presents a modification of our algorithm which enables users to reflect local connectivity in link prediction. The remaining sections focus on the label propagation application. Sec. B and C discuss functionalities of active label acquisition and adding new images to the match graph while Sec. D di...

متن کامل

Active selection with label propagation for minimizing human effort in speaker annotation of TV shows

In this paper an approach minimizing the human involvement in the manual annotation of speakers is presented. At each iteration a selection strategy choses the most suitable speech track for manual annotation, which is then associated with all the tracks in the cluster that contains it. The study makes use of a system that propagates the speaker track labels. This is done using a agglomerative ...

متن کامل

Adaptive Frame Selection for Enhanced Face Recognition in Low-Resolution Videos

Adaptive Frame Selection for Enhanced Face Recognition in Low-Resolution Videos by Raghavender Reddy Jillela Master of Science in Electrical Engineering West Virginia University Arun Ross, PhD., Chair Performing face detection and recognition in low-resolution videos (e.g., surveillance videos) is a challenging task. To enhance the biometric content in these videos, imagelevel and score-level f...

متن کامل

Nonparametric Video Retrieval and Frame Classification using Tiny Videos

A nonparametric video retrieval and frame classification systm that uses affinity propagation algorithm is proposed. The main goal of the proposed system is to develop "tiny video" that achieves high video compression rates while retaining the overall visual appearance of video. The proposed video retrieval system utilizes the strengths of affinity propagation algorithm that uses exem...

متن کامل

Dynamic facial expression recognition using a be- havioural model

A recent interest appears in transportation for users emotion recognition. This permits to adapt car behaviors to drivers mood for safety reasons, or improve public transportation offers. Human emotions are complex and defined by several elements, such as voices intonations or facial expressions. We propose a new dynamic facial expression recognition framework based on Discrete Choice Models (D...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012